Speeding up Gibbs sampling by variable grouping
نویسنده
چکیده
Gibbs sampling is a widely applicable inference technique that can in principle deal with complex multimodal distributions. Unfortunately, it fails in many practical applications due to slow convergence and abundance of local minima. In this paper, we propose a general method of speeding up Gibbs sampling in probabilistic models. The method works by introducing auxiliary variables which represent assignments of the original model variables to groups. Our experiments indicate that the groups converge early in the sampling. After they have converged, the original variables no longer need to be sampled, and it becomes possible to resample an entire group at a time, greatly speeding up the sampler. The proposed ideas are illustrated on LDA and are applicable to many other topic models.
منابع مشابه
Efficient Markov chain Monte Carlo with incomplete multinomial data
Multinomial Data By KWANG WOO AHN and KUNG-SIK CHAN Department of Statistics and Actuarial Science, The University of Iowa, Iowa City, IA USA [email protected] [email protected] Summary We propose a new, block Gibbs sampling scheme for incomplete multinomial data. The new approach facilitates maximal blocking, thereby reducing serial dependency and speeding up the convergence of the ...
متن کاملOnline Sparse Collapsed Hybrid Variational-Gibbs Algorithm for Hierarchical Dirichlet Process Topic Models
Topic models for text analysis are most commonly trained using either Gibbs sampling or variational Bayes. Recently, hybrid variational-Gibbs algorithms have been found to combine the best of both worlds. Variational algorithms are fast to converge and more efficient for inference on new documents. Gibbs sampling enables sparse updates since each token is only associated with one topic instead ...
متن کاملUsing Bayesian Networks to Identify the Causal Effect of Speeding in Individual Vehicle/Pedestrian Collisions
Estimating individual probabilities of causation generally requires prior knowledge of causal mechanisms. For traffic accidents such knowledge is often available and supports the discipline of accident reconstruction. In this paper structural knowledge is combined with Bayesian network methods to calculate the probability of necessity due to speeding for each of a set of vehicle/pedestrian coll...
متن کاملImproving Performance of Topic Models by Variable Grouping
Topic models have a wide range of applications, including modeling of text documents, images, user preferences, product rankings, and many others. However, learning optimal models may be difficult, especially for large problems. The reason is that inference techniques such as Gibbs sampling often converge to suboptimal models due to the abundance of local minima in large datasets. In this paper...
متن کاملTwo-Way Latent Grouping Model for User Preference Prediction
We introduce a novel latent grouping model for predicting the relevance of a new document to a user. The model assumes a latent group structure for both users and documents. We compared the model against a state-of-the-art method, the User Rating Profile model, where only users have a latent group structure. We estimate both models by Gibbs sampling. The new method predicts relevance more accur...
متن کامل